Pitman–Yor process

In probability theory, a Pitman–Yor process[1] [2] ,[3] denoted PY(dθG0), is a stochastic process whose sample path is a probability distribution. A random sample from this process is a finite-dimensional Pitman–Yor distribution, named after Jim Pitman and Marc Yor. Unfortunately, there is no known analytic form for this distribution.

The parameters governing the Pitman–Yor process are: 0 ≤ d ≤ 1 a discount parameter, a strength parameter θ > −d and a base distribution G0 over a probability space  X. When d = 0, it becomes the Dirichlet process. The discount parameter gives the Pitman–Yor process more flexibility over tail behavior than the Dirichlet process, which has exponential tails. This makes Pitman–Yor process useful for modeling data with power-law tails (e.g., word frequencies in natural language).

See also

References

  1. ^ Teh, Yee Whye (2006). "A hierarchical Bayesian language model based on Pitman–Yor processes,". Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics,. 
  2. ^ Pitman, Jim; Yor, Marc (1997). "The two-parameter Poisson–Dirichlet distribution derived from a stable subordinator". Annals of Probability 25 (2): 855–900. doi:10.1214/aop/1024404422. MR1434129. Zbl 0880.60076. 
  3. ^ Pitman, Jim (2006). Combinatorial Stochastic Processes. Berlin: Springer-Verlag. http://works.bepress.com/jim_pitman/1/.